The Listening Machine: Sound Source Organization for Multimedia Understanding
نویسنده
چکیده
Identifying the individual sources present in a real-world sound recording is difficult: Almost without exception, sounds of interest are embedded in a context of competing sounds, and it is rare to be given an unobstructed view of an ideal, isolated target. Human listeners, in common with other auditorily-equipped animals, are adept at handling such mixed signals, but our best computational audition systems — for instance automatic speech recognizers — are highly vulnerable to added interference, even at levels that listeners barely notice.
منابع مشابه
The Effect of Multimedia Glosses on L2 Listening Comprehension
The present study examined the effect of multimedia glosses on foreign language listening comprehension. To this end, 94 male students studying at Rasa English Institute in Tehran were selected for the treatment. The participants consisted of three groups, and each group was randomly assigned to one of the following treatment conditions: textual, pictorial, and textual-pictorial glossing....
متن کاملVirtual Acoustics and 3-d Sound in Multimedia Signal Processing
In this work, aspects in real-time modeling and synthesis of three-dimensional sound in the context of digital audio, multimedia, and virtual environments are studied. The concept of virtual acoustics is discussed, which includes models for sound sources, room acoustics, and spatial hearing. Real-time virtual acoustics modeling is carried out using a real-time parametric room impulse response r...
متن کاملMachine Listening for Context-Aware Computing
Machine listening is an area of study which is rapidly increasing in importance. The proliferation of massive sensory corpora, together with the perceptual needs of smart computational devices and smart spaces has lead to this increase. Machine listening provides both a computationally cheap alternative to machine vision, and a source of information that is complementary to visual information; ...
متن کاملMusical Acoustics and Speech Communication: Musical Pitch Tracking and Sound Source Separation Leading to Automatic Music Transcription II
This paper describes research aimed at building ‘‘active music listening interfaces’’ to demonstrate the importance of music understanding technologies, including sound source separation and F0 estimation, and the benefit they offer to end users. Active music listening is a way of listening to music through active interactions. Given polyphonic sound mixtures taken from available music recordin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002